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DETAILED ACTION 

1 . A request for continued examination under 37 CFR 1.114, including the fee set forth in 
37 CFR 1.17(e), was filed in this application after final rejection. Since this application is 
ehgible for continued examination under 37 CFR 1.1 14, and the fee set forth in 37 CFR 1.17(e) 
has been timely paid, the finality of the previous Office action has been withdrawn pursuant to 
37 CFR 1.114. Applicant's submission filed on 10/21/2004 has been entered. 

Claim Rejections - 35 USC §103 

2. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

3. Claims 1-4,6,9-16,18-23,25,28-34,36-41, and 43 rejected under 35 U.S.C. 103(a) as being 
unpatentable over Kanevsky et al (6529871) in view of Gould et al (6839669) . 

Regarding claims 1&13, Kanevsky et al. disclose a method and system that is comprised 
of a speech-processing system/ apparatus/device/software (collection of methods) that resides in 
the memory of a computer system (Col 5, Line 50 - Col 6, Line 7). Kanevsky et al. also 
discloses the building of user databases [claimed PIM database] and models by using an 
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enrollment process that collects personal information (e.g. name, address, etc) from the user (Col 
8, 30 51). The system contains non-acoustic and acoustic models (language models) for many 
users (Col 7, Line 62 and Col 8, Line 64). Kanevsky et al. disclose a learning algorithm for 
building user model that include the response to queried information from operators, 
informational forms, email forms, web based forms or Integrated Voice Response (IVR) systems 
(Col 7, Line 64 - Col 8, Line 7). This data is stored together with the acoustic models (language 
models) collected by a speech recognition system (Col 8,Line 65), When the user calls the 
central server to gain access, the users acoustic features extracted by an automatic speech 
recognition (ASR) application and the non-acoustic data (personal information data) in the 
database is used to help in the determination of the identity of the speaker (Col 6, Lines 12 -56). 
However, Kanevsky et al (6529871) does not explicitly teach using the language models for use 
in an automatic speech recognizer based on the PIM data, however, Gould et al (6839669) 
teaches the use of language models in PIM data (col. 1 line 59 - col. 2 line 26), wherein the 
spoken utterances by the user are speech recognized (col 3 lines 10-24), wherein a speaker 
profile is developed (col. 9 lines 1-21), and the language models associated with the user (col. 66 
lines 30-50; col. 77 lines 9-30)(also note -> and col. 66 - 77 contains more details with respect to 
the speech recognition process). Therefore, it would have been obvious to one of ordinary skill 
in the art of speech processing to modify the teachings of Kanevesky to include PIM data 
feedback of a user's speech recognition features because it would advantageously improve the 
speech recognition process (Gould, col. 1 lines 43-56). 

Regarding claim 19, Kanevsky et al. disclose a speech-based processing system 
comprising of a database of PIM data associated with a user (Col 6, Line 36 - 40); a set of 
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language models (Col 7, Line 60); a language model server including a learning unit [learning 
algorithm] (Col 8, Line 33-52), a lookup unit (Col 7, Lines 15-40); a recognition server to 
recognizer utterance of the user (Col 8, Line 13) and a speech application to trigger operation of 
the look-up unit to identify and access a subset of the PIM data specified by the utterance using 
the result of recognizing the utterance(Col 7, Lines 30 - 40). However, Kanevsky et al 
(6529871) does not explicitly teach using the language models for use in an automatic speech 
recognizer based on the PIM data, however, Gould et al (6839669) teaches the use of language 
models in PIM data (col. 1 line 59 - col. 2 line 26), wherein the spoken utterances by the user are 
speech recognized (col. 3 lines 10-24), wherein a speaker profile is developed (col. 9 lines 1-21), 
and the language models associated with the user (col. 66 lines 30-50; col. 77 lines 9-30)(also 
note ^ and col. 66 - 77 contains more details with respect to the speech recognition process). 
Therefore, it would have been obvious to one of ordinary skill in the art of speech processing to 
modify the teachings of Kanevesky to include PIM data feedback of a user's speech recognition 
features because it would advantageously improve the speech recognition process (Gould, col. 1 
lines 43-56). 

Regarding claim 28, Kanevsky et al. disclose a speech-based processing system 
comprising of a processor means for executing software (Col 5, Line 64 - Col 6, Line 2); and a 
storage means having stored therein a leaming unit to leam a set of language models based on a 
set of PIM data and a speech application to access a subset of the PIM data specified by a short 
reference to set to subset uttered by a user(Col 7, Lines 30 - 40). However, Kanevsky et al 
(6529871) does not explicitly teach using the language models for use in an automatic speech 
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recognizer based on the PIM data, however, Gould et al (6839669) teaches the use of language 
models in PIM data (col. 1 line 59 - col. 2 line 26), wherein the spoken utterances by the user are 
speech recognized (col. 3 lines 10-24), wherein a speaker profile is developed (col. 9 lines 1-21), 
and the language models associated with the user (col. 66 lines 30-50; col. 77 lines 9-30)(also 
note and col. 66 - 77 contains more details with respect to the speech recognition process). 
Therefore, it would have been obvious to one of ordinary skill in the art of speech processing to 
modify the teachings of Kanevesky to include PIM data feedback of a user's speech recognition 
features because it would advantageously improve the speech recognition process (Gould, col. 1 
lines 43-56). 

Regarding claims 30 & 37, Kanevsky et al. disclose a method of facilitating speech 
recognition comprising of using an automated language model learning process to acquire a set 
of language models based on PIM data associated with a user (Col 8, Lines 35 - 51); recognizing 
an utterance by the user by using one of the language model(Col 8, Lines 52 -66); and using the 
recognized utterance of the user to identify and access a subset of the PIM data(Col 9, Line 2). 
However, Kanevsky et al (6529871) does not explicitly teach using the language models for use 
in an automatic speech recognizer based on the PIM data, however, Gould et al (6839669) 
teaches the use of language models in PIM data (col. 1 line 59 - col. 2 line 26), wherein the 
spoken utterances by the user are speech recognized (col. 3 lines 10-24), wherein a speaker 
profile is developed (col. 9 lines 1-21), and the language models associated with the user (col. 66 
lines 30-50; col. 77 lines 9-30)(also note ^ and col. 66 - 77 contains more details with respect to 
the speech recognition process). Therefore, it would have been obvious to one of ordinary skill 
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in the art of speech processing to modify the teachings of Kanevesky to include PIM data 
feedback of a user's speech recognition features because it would advantageously improve the 
speech recognition process (Gould, col. 1 lines 43-56). 



Regarding claims 2,20,31 & 38, Kanevsky et al. disclose the method of asking the user 
random questions [claimed short references] derived from the personal database. The data will 
include portions or a subset of the data of the personal information data residing in the PIM/user 
database (Col 6, Lines 41-45). 

Regarding claims 3,15,22,33 & 40, Kanevsky et al. disclose that a new user may 
periodically call the central server where the leaming algorithm may be activated. The system 
collects voice samples from the caller's answers to the plurality of questions and builds a user 
voice model which will be provided to the system the next time the speaker calls and needs to be 
identified (Col 8, Line 64 - Col 9, Line 5). 

Regarding claims 4,16,23,34 & 41 Kanevsky et al. disclose that a user may call the 
central server where the leaming algorithm may be activated in real-time to help in the 
determination of the identity of the user by the use of stored language models based on the 
leaming algorithm (Col 7, Lines 28 -33). 



Application/Control Number: 09/813,711 Page 7 

Art Unit: 2655 

Regarding claims 6,18,25,36 & 43 Kanevsky et al. disclose that the language model 
leaming algorithm is based on a statistical language model. Kanevsky describes that a user 
model provided by the leaming unit is used to determine the probability of a particular user's 
identity (Col 11, Lines 3 - 35). 

Regarding claims 9, 10, 1 1,12,14,29,21,32 & 39, Kanevsky et al. disclose that the 
leaming algorithm takes data form a pluraUty of sources, each module containing a set of 
heuristics tailored for acquiring language models for one of a plurality of types of PIM data. The 
data comprises of personal address book, personal calendar and email message data. Kanevsky et 
al. describes categories of personal information data. The categories include static features which 
includes phone number, time of day, etc (personal address book data), dynamic features which 
include trips, meetings (personal calendar data) and email, faxes, etc. The data also includes 
intemal information extracted from dialog that includes gender, speech rate, accent etc. and 
extemal information such as name, address, DOB, etc (Fig 4, Col 10, Lines 42 - 59). 

4. Claims 5,17,24,35 & 42 are rejected under 35 U.S.C. 103(a) as being unpatentable over 
the combination of Kanevsky et al. (U.S. Patent 6529871) in view of Gould et al (6839669) in 
further view of Kremer (IEEE 0-7803-4122-8/97). 



Regarding claims 5,17,24,35 & 42, the combination of Kanevsky et al. (U.S. Patent 
6529871) in view of Gould et al (6839669) discloses a method and system that is comprised of a 
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speech-processing system/ apparatus/device/software (collection of methods) that resides in the 
memory of a computer system (Col 5, Line 50 - Col 6, Line 7). The system uses personal data to 
choose language models base on input speech. The combination of Kanevsky et al. (U.S. Patent 
6529871) in view of Gould et al (6839669) does not disclose the use of a language model 
algorithm that utilizes grammar induction. However, Kremer teaches a grammar induction 
technique that avoids the limitations of classical grammar induction algorithms (Page 1424, 
Paragraph 1 & 2). Granmiar induction is a necessary process by which the language models 
based on a finite set of example strings from the PIM data will be used to identify grammar, for a 
potentially infinite set of strings or language. Therefore, it would have been obvious to those 
skilled in the art at the time of the invention to modify the combination of Kanevsky et al. (U.S. 
Patent 6529871) in view of Gould et al (6839669) with the use of language models that utiUze 
grammar induction as taught by Kremer since it is essential for language models utilized in 
automatic speech recognition systems. (Kremer, page 1424, first paragraph). 

5. Claims 7,8,26,27,44 & 45 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Kanevsky et al. (U.S. Patent 6529871) in view of Gould et al (6839669) in 
fiirther view of Fitzpatrick et al. (U.S. Patent Application 20020184033). 

Regarding claims 7,26 & 44, Kanevsky et al. (U.S. Patent 6529871) in view of Gould et 
al (6839669) discloses a method and system that is comprised of a speech-processing 
system/apparatus/device/software (collection of methods) that resides in the memory of a 
computer system (Col 5, Line 50 - Col 6, Line 7). The system uses personal data to choose 
language models base on input speech: Kanevsky do not disclose the use of an API designed to 
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access PIM data. However, Fitzpatrick et al. teach the use of Standard Application Program 
Interface (API) for integrating other Text-to-Speech Servers (Page 4, Paragraph 0116). APIs are 
important and necessary for developers to efficiently integrate large-scale software applications 
and modules. Therefore, it would have been obvious to those skilled in the art at the time of the 
invention to modify the combination of Kanevsky et al. (U.S. Patent 6529871) in view of Gould 
et al (6839669) with the use of APIs as taught by Fitzpatrick et al. since it would have made the 
integration of the learning modules with other applications more efficient. (Fitzpatrick, page 4, 
paragraph 01 16). 

Regarding claims 8,27 & 45, Kanevsky et al. (U.S. Patent 6529871) in view of Gould et 
al (6839669) does not disclose the use of providing the PIM data in XML format. However, 
Fitzpatrick et al. teach the use of providing documents in XML format (Page 6, Paragraph 0175). 
XML is a general and highly flexible representation of any type of data. XML uniform tags make 
it easy to map one XML structure to another or to map XML into other data formats. Therefore, 
it would have been obvious to those skilled in the art at the time of the invention to modify the 
combination of Kanevsky et al. (U.S. Patent 6529871) in view of Gould et al (6839669) with the 
use of XML as taught by Fitzpatrick et al. since it would have made the data more uniform so 
that it can be transferred and mapped to other types of data. 

Response to Arguments 

6. Applicant's arguments with respect to claims 1-45 have been considered but are moot in 
view of the new ground(s) of rejection. 
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Conclusion 



7. Any response to this action should be mailed to: 

Commissioner of Patents and Trademarks 
Washington, D.C. 20231 
or faxed to: 
(703) 872 9314, 

(for informal or draft communications, please label "PROPOSED" or 
"DRAFT") 

Hand-delivered responses should be brought to Crystal Park II, 2121 
Crystal Drive, Arlington. VA., Sixth Floor (Receptionist). 

8. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Michael Opsasnick, telephone number (703)305-4089, 
who is available Tuesday-Thursday, 9am-4pm. 

If attempts to reach the examiner by telephone are unsuccessfiil, the examiner's 
supervisor, Ms. Doris To, can be reached at (703)305-4827. The facsimile phone number 
for this group is (703)872-9306. 

Any inquiry of a general natixre or relating to the status of this application or 
proceeding should be directed to the Group 2600 receptionist whose telephone number is 
(703) 305-4750, the 2600 Customer Service telephone number is (703) 306-0377. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or PubUc PAIR. Status 
information for unpublished applications is available through Private PAIR only. For 
more information about the PAIR system, see http://pair-direct.uspto.gov. Should you 
have questions on access to the Private PAIR system, contact the Electronic Business 
Center (EBC) at 866-217-9197 (toll-free). 
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